Automatic Identification of Predicate Heads in Chinese Sentences

نویسندگان

  • Xiaona Ren
  • Qiaoli Zhou
  • Chunyu Kit
  • Dongfeng Cai
چکیده

We propose an effective approach to automatically identify predicate heads in Chinese sentences based on statistical pre-processing and rule-based post-processing. In the preprocessing stage, the maximal noun phrases in a sentence are recognized and replaced by “NP” labels to simplify the sentence structure. Then a CRF model is trained to recognize the predicate heads of this simplified sentence. In the post-processing stage, a rule base is built according to the grammatical features of predicate heads. It is then utilized to correct the preliminary recognition results. Experimental results show that our approach is feasible and effective, and its accuracy achieves 89.14% on Tsinghua Chinese Treebank.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Representing Topic-Comment Structures in Chinese

Shi (2000) claims that topics must be related to a syntactic position in the comment, thus denying the existence of dangling topics in Chinese. Under Shi's analysis, the dangling topic sentences in Chinese are not topic-comment but subject-predicate sentences. However, Shi's arguments are not without problems. In this paper we argue that topics in Chinese can be licensed not only by a syntactic...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

Chinese Argument Extraction Based on Trigger Mapping

Unlike English, Chinese sentences do not have a strict syntactic structure and ellipsis is a common phenomenon, which weaken the effectiveness of syntactic structure in argument extraction. In Chinese event extraction, lots of arguments cannot be extracted from the sentence successfully, because of the loose connection between the nominal trigger and its arguments. This paper brings forward a n...

متن کامل

An Experimental Study on the Assignment of Focus Accent in Mandarin

This paper investigates the distribution of focus-related accents in the broad focus domain in Chinese Mandarin through 300 natural sentences. The results show that focus –related accent tends to be assigned to the predicate in a subject-predicate structure, to the object in a predicate-object structure, and to the head in an adjunct-head structure unless the head is highly predictable. From th...

متن کامل

Three Sensitive Positions and Chinese Complex Sentences: A Comparative Perspective

The positioning of sentential connectives in Chinese complex sentences is more flexible than their counterparts in English. Sentential connectives in Chinese can be placed in three sensitive positions: clause-initial, predicate-initial, and clause-final positions. Due to the co-existence of prepositions and postpositions in the language, sentential connectives can be placed in both clause-initi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010